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Summary. Stellar population synthesis codes provide the description of the total 
luminosity resulting from the combination of an ensemble of sources as a function 
of given physical parameters. Their final aim is to enable users to make inferences 
about the physical parameters (star formation history, mass transformed into stars, 
etc.) from the light observed in stellar clusters or galaxies. However, synthesis codes 
results cannot be interpreted in an one-to-one relation between an observed total 
luminosity and a theoretical model result since (a) we have not access to the intimate 
composition of the ensemble of stars and (b) the very theoretical method implicit 
in populations synthesis makes use of (probabilistic) distributions and there is not 
an unique model result that describes the integrated luminosity of an ensemble. 
In general, synthesis models provide the mean value of the distribution of possible 
integrated luminosities, this distribution (and not only its mean value) being the 
actual description of the integrated luminosity. 

Therefore, to obtain the closest model to an observation only provides confi- 
dence about the precision of such a fit, but not information about the accuracy of 
the result. In this contribution we show how to overcome this drawback and we 
propose the use of the theoretical mean-averaged dispersion that can be produced 
by synthesis models as a metric of fitting to infer accurate physical parameters of 
observed systems. 



1 Introduction 

Stellar populations synthesis models have been widely used to make inferences 
about the evolutionary status of observed systems by means of a comparison 
between observations and model results. In general it is assumed that the 
observed system is well mixed in terms of stellar populations and that mod- 
els provide an one-to-one relation between the integrated luminosity and the 
physical conditions under the model is performed. In other words, it is assumed 
that the distribution of the possible integrated luminosities of a cluster with 
given physical conditions (including its number of stars) is a Dirac's delta-like 
distribution. 
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However, this assumption is not compatible with the very method implicit 
in synthesis codes, when they make use of probability distributions, like the 
Initial Mass Function, to describe the contribution of different stellar types to 
the integrated light. In the most optimistic case, and following the central limit 
theorem, the distribution of integrated luminosities must follow a gaussian like 
probability distribution in the asymptotic case of complete well mixed stellar 
types in the cluster (i.e. theoretical clusters with infinite number of stars). See 
[5] for more details. 

In this contribution we discuss the distributed nature of theoretical inte- 
grated luminosities of stellar clusters. We take advantage of such distributions 
to propose a metric that allows to establish the accuracy of fits of observations 
with models. We will assume that models are not affected by systematic errors 
(but see [4, 6] for detailed discussions on this subject), and we will consider 
here only the intrinsic dispersion of the theoretical integrated luminosities. 

2 Understanding synthesis models 

The first step needed to use population synthesis models is to understand how 
synthesis codes work. The main goal of synthesis codes is to obtain the inte- 
grated luminosity resulting from an ensemble of individual stars with given 
physical conditions (initial mass function IMF, metallicity Z, star formation 
history SFH, age t...). Let us assume that n individual stars have been ob- 
served in a resolution clement of a telescope (e.g. a pixel, a slit, an IFU, a set 
of several pixels, etc.). Since each star has an actual luminosity £j, the total 
luminosity Ltot in the resolution element is: 

n 1 n 

L tot = Yj i = n-Yj i = n <£>. (1) 
i=i i=i 

where we have rewritten L to t as a function of n and < I > is the estimation of 
the mean luminosity of an individual star that represents the whole ensemble 
for the actual physical conditions. 

However, we cannot perform such a sum and obtain the actual integrated 
luminosity of this set of individual stars since these stars are not resolved and 
we do not know the individual li values. Instead, we can compute the actual 
mean luminosity of all the ensembles with the same physical conditions: It 
should be the composition of iV c i ass different stellar populations (or different 
stellar types) for the given physical conditions. If we assume that a given stellar 
type "j" can be represented by a luminosity lj and we know the probability 
uij to have a star with a given stellar type determinated by the given physical 
conditions (wj — w } -.[t, Z, IMF, SFH]), then a total luminosity of the system 
will be 
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L£~(n)=n Y,w j l J =n<£> 



(2) 



Vclaas 

J2 w i = L 



(3) 



3=1 



where < £ > is the luminosity of an average star that represents the ensemble 
for the given physical conditions. Synthesis codes provide < I > with a suitable 
normalization, like the one in Eq. 3. 

Obviously, the average integrated luminosity of stellar clusters with n stars 
is n times the luminosity of this average star, but the possible integrated 
luminosity of a cluster is a distributed quantity. In fact, Ll^°(n) in Eq. 2 does 
not provide the whole description of the possible integrated luminosity of an 
observation of n unresolved stars, but only the mean value of the underlying 
probability distribution that describes the possible integrated luminosities of 
all observations that contain n unresolved stars. In the following we will use 
< Ll^°(n) > instead of L^f°(n) to avoid confusion about this point. 

Since we know that the theoretical integrated luminosity is a distributed 
quantity, we can go further and also obtain other quantities, besides the mean, 
which describe this distribution. In particular, the variance (which depends 
on how many unresolved stars are in the resolution element) and the variance 
normalized to the mean integrated luminosity (which is independent on the 
number of stars), we refer [5] for more details in this subject 1 . 



Summing up, synthesis codes provide the luminosity of an average star 
or cluster (depending on the normalization used in Eq. 3) that describes the 
mean luminosity of the ensemble for given physical conditions. Codes can also 
provide the dispersion of the integrated luminosities around the mean by the 
use of Eq. 4 or Eq. 5. 

In the following sections we show two different uses of the dispersion as 
metrics for goodness-of-fit in stellar population studies. We will assume that 
the theoretical probability distributions of integrated luminosities follow a 
gaussian distribution (see [5] for reliable criteria of gaussianity of synthesis 
model results). 



1 Eq. 5 is related to Surface Brightness Fluctuations (SBF) when they are consid- 
ered as a description of the variance of synthesis models; see [1, 2, 3, 7] for more 
details. 





o- 2 [L£t°(n)} a\l) 



(5) 



<L£T(n)> <?■>' 
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3 Studying stellar populations with synthesis models 

Let us move now to the application of synthesis models to study stellar popu- 
lations in galaxies and stellar clusters. The problem is to infer the evolution- 
ary conditions that are compatible with the observed integrated luminosity 
Ltot and to establish the confidence of the different hypothesis. Following 
D'Agostini work From Observations to Hypotheses [8]: "it becomes crucial to 
learn how to deal quantitatively with probability of causes because the "prob- 
lem(s) in the probability of causes... may be said to be the essential problem(s) 
of the experimental method" (Poincare [10]).". 

Let us illustrate the problem in Fig. 1. We want to evaluate which model 
Hi, H2 or H3 is more compatible with an observation (black cross). In the 
left panel we show the situation where only the mean value of the model is 
taken into account. In this case, Hi is the closest model to the observation 
and it is the "best" solution. In the right panel we show the situation when 
the complete theoretical distribution is taken into account. The different con- 
centric circles represent 1, 2 and 3 a confidence intervals associated to each 
one of the models. Hi is still the closest model but the probability that the 
observation is compatible with Hi is lower than 1% (i.e. the observation lies 
outside of the 99% confidence interval of Hi). The observation is compatible 
with H2 in a 2a confidence interval of H2 and compatible with H3 in a 3<r 
confidence interval of H3. Hence, the "closest" model (following each model's 
metrics) is H2 . This second solution is different than the previous (precise but 
not accurate) one. 



Accuracy and precision 




is the most accurate model, 
although H 1 is the most precise one. 

Fig. 1. Precision and accuracy; see text for details. 

Summing up, since our main interest is to have accurate (and not only pre- 
cise) results, it is necessary to know the complete probability distributions of 
theoretical integrated luminosities and not only its mean values < L\^°{n) >. 
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The probability distribution of theoretical integrated luminosities provides a 
metric for goodness- of- fit to stellar population studies. Analysis method must 
obtain the closest model to the observation according with the metric defined 
by each model itself, and not by an "absolute" metric that does not take into 
account the intrinsic model dispersion. 

We illustrate in the following sections two different ways of using the metric 
associated to theoretical models. 



4 Distance-dependent analysis 

Let us first work with the variance of the distribution (a 2 in Eq. 4). The 
dispersion (cr) of the integrated luminosity is proportional to the square-root 
of the number of stars in the observed population, so we need to have an 
estimation of how many stars are in our resolution element, or, equivalcntly, 
the distance to the source. 




3000 3500 4000 4500 5000 5500 6000 6500 7000 

w.l. [A] 



1u ~ 60% CL 



5000 
w.l. [A] 



Fig. 2. Top: Integrated mean spectra of a solar metallicity 1 Ga old cluster with 10 5 
stars. Bottom: lo confidence interval of the relative dispersion of (L\— < Ll^°(n = 
10 5 ) >)/ < Ltt°{n = 10 5 ) >. Models from [9]. 



Figs. 2 and 3 show the mean energy distribution and relative dispersion of 
a model of a solar metallicity 1 Ga old cluster with 10 5 stars. The first thing to 
notice is that the relative dispersion increases almost by a factor two (at this 
age and metallicity) at red wavelengths. This happens because the stars that 
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Fig. 3. Detail of Fig 2 in the H5 wavelength range (left, with vertical lines showing 
the position of H<5, Ca II H + He and Ca II K lines) and the H/3 lines (right, with 
vertical line showing the position of H/3 line). 

contribute the most to the mean integrated luminosity at red wavelengths 
are also the most sparse ones and a small variation in the total number of 
such stars is translated to a large variation in the integrated luminosity. This 
situation can be reformulated by saying that infrared wavelengths have a 
lower effective number of stars than the visible (see [1, 2, 3] for a more detailed 
description and implications about the effective number of stars that contibute 
to a given wavelength). 

Second: line profiles have different dispersion degree (cfr. Fig. 3). The H/3 
line has an intrinsic higher dispersion than US. This means that the profile of 
HP is more sensitive than the profile of H5 to particular stellar populations 
in the cluster, so it can provide information about those particular stellar 
populations, but not about the overall population. 

5 Distance independent analysis 

As an alternative to the relative dispersion, we can use the mean-averaged 
dispersion defined in Eq. 5. The advantage of the mean-averaged dispersion 
is that it is independent of the number of stars in the cluster (hence, also 
independent of the distance) . Figs. 4 and 5 show the mean energy distribution 
and mean-averaged dispersion of a model of a solar metallicity 1 Ga old. In 
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this case, the mean spectra (top panels) is the one of an average individual 
star, as mentioned in section 2. 
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Fig. 4. Top: Integrated mean spectra of a 1 Ga old cluster. Bottom: 1 and 2a 
confidence intervals of the mean-averaged dispersion (-La— < L\^° >)/ \l < Ltot° >■ 
Models from [9]. 

As previously, the mean-averaged dispersion increases at red wavelengths. 
Absorption lines show a different behavior when expressed in this new units, 
but again, it shows similar conclusions: not all absorption lines can be fitted 
with the same precision. 

6 Conclusions 

Four conclusions can be drawn from this work: (a) the comparison of models 
and observations must be done taking into account the intrinsic metric of 
each model and not an absolute metric, (b) not all wavelengths must have 
the same weight in the fitting, (c) fits of line profiles have, in general, a larger 
dispersion than fits of the continuum, and (d) the more sensitive is the index 
(line profile, equivalent width, color...) about a particular stellar population, 
the less useful is the index to describe the overall evolutionary status of the 
population. 
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Fig. 5. Detail of Fig 4 in the H5 wavelength range (left, with vertical lines showing 
the position of H5, Ca II H + He and Ca II K lines) and the H/3 wavelength range 
(right, with vertical line showing the position of the H/3 line). 
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